Goto

Collaborating Authors

 Valencia











InvestigatingGenderBiasinLanguageModels UsingCausalMediationAnalysis

Neural Information Processing Systems

A popular class of analysis methods, often called structural analysis, aims to extract this information using probing classifiers that predict linguistic properties from representations of trained models (e.g., Adi et al., 2017; Conneau et al., 2018; Hupkes et al., 2018; Tenney et al., 2019).